Enhancement of historical printed document images by combining Total Variation regularization and Non-local Means filtering

نویسندگان

  • Laurence Likforman-Sulem
  • Jérôme Darbon
  • Elisa H. Barney Smith
چکیده

This paper proposes a novel method for document enhancement which combines two recent powerful noise-reduction steps. The first step is based on the total variation framework. It flattens background grey-levels and produces an intermediate image where background noise is considerably reduced. This image is used as a mask to produce an image with a cleaner background while keeping character details. The second step is applied to the cleaner image and consists of a filter based on non-local means: character edges are smoothed by searching for similar patch images in pixel neighborhoods. The document images to be enhanced are real historical printed documents from several periods which include several defects in their background and on character edges. These defects result from scanning, paper aging and bleedthrough. The proposed method enhances document images by combining the total variation and the non-local means techniques in order to improve OCR recognition. The method is shown to be more powerful than when these techniques are used alone and than other enhancement methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A mask-based enhancement method for historical documents

This paper proposes a novel method for document enhancement. The method is based on the combination of two state-of-the-art filters through the construction of a mask. The mask is applied to a TV (Total Variation) regularized image where background noise has been reduced. The masked image is then filtered by NLmeans (Non Local Means) which reduces the noise in the text areas located by the mask...

متن کامل

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Combining Total Variation and Nonlocal Means Regularization for Edge Preserving Image Deconvolution

We propose a new edge preserving image deconvolution model by combining total variation and nonlocal means regularization. Natural images exhibit an high degree of redundancy. Using this redundancy, the nonlocal means regularization strategy is a good technique for detail preserving image restoration. In order to further improve the visual quality of the nonlocal means based algorithm, total va...

متن کامل

Image Deblurring Via Total Variation Based Structured Sparse Model Selection

Retina imaging technology is an effective control method for early diagnosis and early treatment of the diabetic retinopathy. In this paper, a fast robust inverse diffusion equation combining a blockwise filtering is presented to detect and evaluate diabetic retinopathy using retinal image enhancement. A flux corrected transport technique is used to control diffusion flux adaptively, which elim...

متن کامل

Total Variation as a Local Filter

In the Rudin-Osher-Fatemi (ROF) image denoising model, Total Variation (TV) is used as a global regularization term. However, as we observe, the local interactions induced by Total Variation do not propagate much at long distances in practice, so that the ROF model is not far from being a local filter. In this paper, we propose to build a purely local filter by considering the ROF model in a gi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Image Vision Comput.

دوره 29  شماره 

صفحات  -

تاریخ انتشار 2011